Incremental Cosine Computations for Search and Exploration of Tag Spaces

نویسندگان

  • Raymond Vermaas
  • Damir Vandic
  • Flavius Frasincar
چکیده

Tags are often used to describe user-generated content on the Web. However, the available Web applications are not incrementally dealing with new tag information, which negatively influences their scalability. Since the cosine similarity between tags represented as co-occurrence vectors is an important aspect of these frameworks, we propose two approaches for an incremental computation of cosine similarities. The first approach recalculates the cosine similarity for new tag pairs and existing tag pairs of which the co-occurrences has changed. The second approach computes the cosine similarity between two tags by reusing, if available, the previous cosine similarity between these tags. Both approaches compute the same cosine values that would have been obtained when a complete recalculation of the cosine similarities is performed. The performed experiments show that our proposed approaches are between 1.2 and 23 times faster than a complete recalculation, depending on the number of co-occurrence changes and new tags.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Improving Search and Exploration in Tag Spaces Using Automated Tag Clustering

In recent years we have experienced an increase in the usage of tags to describe resources. However, the free nature of tagging presents some challenges regarding the search and exploration of tag spaces. In order to deal with these challenges we propose the Semantic Tag Clustering Search (STCS) framework. The framework first groups syntactic variations using several measures based on the Leven...

متن کامل

A semantic-based approach for searching and browsing tag spaces

In this thesis we propose the Semantic Tag Clustering Search framework (STCS). This framework consists of three parts. The first part deals with syntactic variations by clustering tags that are syntactic variations of each other and assigning a label to them. The second part of the framework addresses the problem of recognizing homonyms and identifying semantically related tags. The last, and f...

متن کامل

Scaling Pair-Wise Similarity-Based Algorithms in Tagging Spaces

Users of Web tag spaces, e.g., Flickr, find it difficult to get adequate search results due to syntactic and semantic tag variations. In most approaches that address this problem, the cosine similarity between tags plays a major role. However, the use of this similarity introduces a scalability problem as the number of similarities that need to be computed grows quadratically with the number of...

متن کامل

Shear Waves Through Non Planar Interface Between Anisotropic Inhomogeneous and Visco-Elastic Half-Spaces

A problem of reflection and transmission of a plane shear wave incident at a corrugated interface between transversely isotropic inhomogeneous and visco-elastic half-spaces is investigated. Applying appropriate boundary conditions and using Rayleigh’s method of approximation expressions for reflection and transmission coefficients are obtained for the first and second order approximation of the...

متن کامل

A Cluster-Based Approach for Search and Exploration of Tag Spaces

Although Semantic Web technology is increasingly becoming more and more important, tagging remains a popular method to describe Web resources. Therefore it is important to address the issues that are found in current tagging search engines, such as Flickr. We find that the free nature of tagging results in many issues for tag search engines, such as synonyms, homonyms, syntactic variations, etc...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2012